Reservoir risk modelling using a hybrid approach based on the feature selection technique and ensemble methods
نویسندگان
چکیده
Flash flooding is a type of global devastating hydrometeorological disaster that seriously threatens people’s property and physical safety, as well the normal operation water conservancy facilities, such reservoirs, so an accurate assessment reservoir risk for certain areas necessary. Therefore, purpose this study was to propose novel methodological approach modelling based on feature selection method (FSM) tree-based ensemble methods (Bagging Random Forest [RF]). The results showed that: (1) J48-GA models achieved higher learning predictive capabilities compared conventional without FSM. (2) For classification accuracy, J48-GA-RF (96.4%) outperformed RF (96.0%), J48-GA-Bagging (93.9%) Bagging (93.5%). And highest prediction AUC value (0.995), almost perfect Kappa indexes (0.926) best practicality (30.88%). (3) In particular, indicated all high performance, both in training validation dataset. Additionally, could provide reference managers, hydraulic engineers policy makers implement location-specific flash flood reduction strategies.
منابع مشابه
A Classification Method for E-mail Spam Using a Hybrid Approach for Feature Selection Optimization
Spam is an unwanted email that is harmful to communications around the world. Spam leads to a growing problem in a personal email, so it would be essential to detect it. Machine learning is very useful to solve this problem as it shows good results in order to learn all the requisite patterns for classification due to its adaptive existence. Nonetheless, in spam detection, there are a large num...
متن کاملAn Ensemble Based Approach for Feature Selection
This paper proposes an ensemble based approach for feature selection. We aim at overcoming the problem of parameter sensitivity of feature selection approaches. To do this we employ ensemble method. We get the results per different possible threshold values automatically in our algorithm. For each threshold value, we get a subset of features. We give a score to each feature in these subsets. Fi...
متن کاملA Novel Ensemble Classifier based Classification on Large Datasets with Hybrid Feature Selection Approach
Exploring and analyzing large datasets has become an active research area in the field of data mining in the last two decades. There had been several approaches available in the literature to investigate the large datasets that comprise of millions of data. The most important data mining approaches involved in this task are preprocessing, feature selection and classification. All the three appr...
متن کاملConstructing response model using ensemble based on feature subset selection
In building a response model, determining the inputs to the model has been an important issue because of the complexities of the marketing problem and limitations of mental models for decision-making. It is common that the customers’ historical purchase data contains many irrelevant or redundant features thus result in bad model performance. Furthermore, single complex models based on feature s...
متن کاملA Novel Scheme for Improving Accuracy of KNN Classification Algorithm Based on the New Weighting Technique and Stepwise Feature Selection
K nearest neighbor algorithm is one of the most frequently used techniques in data mining for its integrity and performance. Though the KNN algorithm is highly effective in many cases, it has some essential deficiencies, which affects the classification accuracy of the algorithm. First, the effectiveness of the algorithm is affected by redundant and irrelevant features. Furthermore, this algori...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Geocarto International
سال: 2021
ISSN: ['1010-6049', '1752-0762']
DOI: https://doi.org/10.1080/10106049.2020.1852615